DOCUMENT PROCESSING APPARATUS AND 
DOCUMENT PROCESSING METHOD 

BACKGROUND OF THE INVENTION 
The present invention relates to a document 
processing apparatus and method which analyzes a 
tagged document, e.g., a Hyper Text Markup Language 
(HTML) document and forms another tagged document 
containing original sentences and translated sentences. 
The present invention also relates to a recording 
medium for recording such a tagged document. 

With the recent proliferation of personal 
computers and communication apparatus, people have 
become able to use communication networks represented 
by the Internet, i.e., Internet protocol (IP) 
communication networks, and to easily obtain various 
sorts of information through the networks. Ordinarily, 
the World Wide Web in the Internet generally uses HTML 
as a language for describing information. Dynamic HTML 
(DHTML) and Extensible Markup Language (XML) are other 
languages presently used to form tagged documents. 

Conventionally, to form home pages (also called 
Web pages) containing the same information described 
in different languages, e.g., English and Japanese, 
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the process of separately forming each home page is 
required. That is, the steps of forming sentences 
separately in each language, pasting common images 
including graphs and figures, separately setting links 
from each of the English home page and the Japanese 
home page, etc., are required. 

Home pages are open to the public on the Internet 
and can be read by people everywhere in the world. 
Therefore, people who set up their home pages for 
various readers to read are making home pages having 
the same format and contents but having sentences 
written in different languages. 

The English and Japanese home pages formed as 
described above need scrupulous attention for 
maintenance because their English and Japanese 
sentences after correction must be equivalent in 
meaning and format. 

To form such English and Japanese home pages, it 
is necessary to make and manage two kinds of HTML 
document files for the English home page and the 
Japanese home page. That is, the number of files to be 
managed is increased in proportion to the number of 
languages, and management and maintenance becomes 
difficult . 
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A user who wishes to read document information in 
an English home page in Japanese may translate the 
English document information by using Internet 
translation software. However, if the user wishes to 
edit the results of the translation displayed as an 
HTML document, he or she must give up the idea of 
doing so or is further required to translate the 
translation results by different translation software, 
because direct editing of the translated HTML document 
is impossible . 

If the user dares to edit the translation-result 
HTML document, he or she must perform the steps of 
storing the translation-result HTML document on a 
local disk, opening the HTML document file stored on 
the local disk by using HTML document editing software, 
displaying the HTML document source, directly editing 
the HTML document source, and storing the results of 
the editing on the local disk. This process enables 
editing of the translation results to some effect. 
However, it is difficult to edit a document in which 
HTML tags, original sentences, and translated 
sentences are mixed. 

Further, in a case where an HTML document 
intended as an object of translation is prepared in 
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advance and, from this document, another HTML document 
described in a different language is formed by- 
translation processing using Internet translation 
software, a need may arise to edit the HTML document 
in the second language formed by the translation 
processing and, if necessary, the translation-object 
HTML document, if the author of the HTML document is 
not satisfied with the results of the translation. 

In this editing, it is difficult to determine 
document portions to be edited and to confirm the 
correspondence between original and translated 
sentences, since the translation-object HTML document 
and the translation-result HTML document exist in 
separate files. It is also possible that, through 
editing, the page configuration (format) of one 
document will become different from that of the other. 

As described above, the conventional HTML 
document processing apparatus can be designed to 
enable translation of an original home page on the 
Internet using Internet translation software and 
visual display of original and translated sentences in 
a juxtaposed form. However, in editing translation 
results, an HTML document itself cannot be edited. 
There is a way to directly edit the HTML document 
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source, but editing in such a way is extremely 
troublesome and not satisfactorily effective. 

SUMMARY OF THE INVENTION 
In view of the above-described circumstances, an 
object of the present invention is to provide a 
document processing apparatus and a recording medium 
which make it possible to easily form and maintain a 
home page (Web page data) expressed in two or more 
languages . 

Another object of the present invention is to 
provide a document processing apparatus and a 
recording medium which make it possible to easily edit 
translated sentences obtained as a result of 
translation of a tagged document. 

Still another object of the present invention is 
to provide a document processing apparatus and a 
recording medium which make it possible to selectively 
display original sentences in a tagged document and 
translated sentences obtained as a result of 
translation of the tagged document. 

A further object of the present invention is to 
provide a document processing apparatus and a 
recording medium which make it possible to return the 
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edited translation sentences of the result of 
translation into a tagged document and to use it. 

A further object of the present invention is to 
provide a document processing apparatus and a 
recording medium which make it possible to easily edit 
original sentences in a tagged document to from a more 
favorable translation. 

To achieve the above-described objects, according 
to one aspect of the present invention, there is 
provided a document processing apparatus comprising a 
language tag setting unit for setting a language tag 
designating a kind of language at each of constituent 
unit positions in an original and those in a 
translated version corresponding to the original, and 
a document forming unit for forming a tagged document 
including the original and the translated version each 
having the language tag set therein. 

The above-described document forming unit may- 
form a tagged document in which the original and the 
translated version each having the language tag set 
therein are described in an original- versus- vers ion 
form . 

The above-described document processing apparatus 
further comprises a translation unit for translating 
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the original to make the corresponding translated 
vers ion . 

The original may be contained in the processing- 
object tagged document. 

The above-described document processing apparatus 
further comprises a visibility control tag setting 
unit for setting a visibility control tag for 
controlling any document portion so that the document 
portion is invisible, the visibility control tag 
setting unit setting the visibility control tag at 
such a position that one of the original and the 
translated version is in an invisible state. 

The visibility control tag setting unit may set 
the visibility control tag at such a position that one 
of the original and the translated version is set in 
an invisible state and the language tag is also set in 
an invisible state. 

The above-described document processing apparatus 
further comprises a display processing unit for 
interpreting the tag contained in the tagged document 
and for enabling a browser for displaying the tagged 
document to display the formed tagged document on the 
basis of a result of the interpretation in the above 
document processing apparatus. 
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According to another aspect of the present 
invention, there is provided a document processing 
apparatus comprising a language tag setting unit for 
setting a language tag at each of constituent unit 
positions in a first text described in a first 
language and those in a second text described in a 
second language of the first text, and a document 
forming unit for forming a tagged document including 
the first text and the second text each having the 
language tag set therein. 

The above-described document forming unit may- 
form a tagged document in which the first text and the 
corresponding second text each having the language tag 
set therein are described by being related to each 
other . 

The above-described document processing apparatus 
further comprises a visibility control tag setting 
unit for setting a visibility control tag for 
controlling a text so that the text is invisible, the 
visibility control tag setting unit setting the 
visibility control tag at such a position that one of 
the first text and the second text is in an invisible 
state . 

The visibility tag setting unit may set the 
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visibility control tag at such a position that one of 
the first text and the second text is set in an 
invisible state and the language tag is also set in an 
invisible state. 

The above-described document processing apparatus 
further comprises a display processing unit for 
interpreting the tag contained in the tagged document 
and for enabling a browser for displaying the tagged 
document to display the formed tagged document on the 
basis of a result of the interpretation. 

According to still another aspect of the present 
invention, there is provided a document processing 
apparatus comprising an analysis unit for determining 
and extracting an original text from a processing- 
object tagged document, and an editing unit for 
enabling edit processing of the original text 
extracted from the tagged document by displaying the 
original text. 

The analysis unit may determine the original text 
according to a language tag contained in the 
processing-object tagged document. 

The above-described document processing apparatus 
further comprises a translation processing unit for 
making a translated version by translating the 
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original text edited by the editing unit. 

The above-described document processing apparatus 
further comprises a restoration unit for restoring, 
from the original text after the editing and the 
translated version made by the translation processing 
unit, a tagged document in the same format as the 
processing-object tagged document according to tags 
contained in the processing-object tagged document, 
the tags including the language tag. 

According to yet another aspect of the present 
invention, there is provided a document processing 
apparatus comprising an analysis unit for determining 
an original text and a translated version 
corresponding to the original text in a processing- 
object tagged document, and an editing unit for 
enabling edit processing of the original text and the 
translated version by extracting the original text and 
the translated version from the tagged document and by 
displaying the original text and the translated 
version in an or iginal-ver sus-ver s ion form. 

The analysis unit may determine the original text 
and the translated version according to language tags 
contained in the processing-object tagged document. 

The above-described document processing apparatus 
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further comprises a translation processing unit for 
making a translated version by translating the 
original text edited by the editing unit. 

The above-described document processing apparatus 
further comprises a restoration unit for restoring, 
from the original text after the editing and the 
translated version made by the translation processing 
unit, a tagged document in the same format as the 
processing-object tagged document according to tags 
contained in the processing-object tagged document, 
the tags including the language tag. 

According to a further aspect of the present 
invention, there is provided a recording medium 
readable by a computer, the recording medium having a 
program recorded thereon, the program for enabling the 
computer to execute a step of setting a language tag 
designating a kind of language at each of constituent 
unit positions in an original and those in a 
translated version corresponding to the original, and 
a step of forming a tagged document including the 
original and the translated version each having the 
language tag set therein. 

The program recorded on this recording medium may 
enable the computer to further execute a step of 
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setting a visibility control tag for controlling any 
document portion so that the document portion is 
invisible, the visibility control tag being set at 
such a position that one of the original and the 
translated version is in an invisible state. 

According to still a further aspect of the 
present invention, there is provided a recording 
medium readable by a computer, the recording medium 
having a program recorded thereon, the program for 
enabling the computer to execute a step of setting a 
language tag designating a kind of language at each of 
constituent unit positions in a first text described 
in a first language and those in a second text 
described in a second language of the first text, and 
a step of forming a tagged document including the 
first text and the second text each having the 
language tag set therein. 

The program recorded on this recording medium may 
enable the computer to further execute a step of 
setting a visibility control tag for controlling a 
text so that the document is invisible, the visibility 
control tag being set at such a position that one of 
the first text and the second text is in an invisible 
state . 
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According to still a further aspect of the 
present invention, there is provided a recording 
medium readable by a computer, the recording medium 
having a program recorded thereon, the program for 
enabling the computer to execute a step of determining 
and extracting an original text from a processing- 
object tagged document, and a step of enabling editing 
of the text extracted from the tagged document by 
displaying the original text. 

The program recorded on this recording medium may 
enable the computer to further execute a step of 
making a translated version by translating the 
original text after editing of the original text, and 
a step of restoring, from the original text after the 
editing and the translated version made by the 
translation, a tagged document in the same format as 
the processing-object tagged document according to 
tags contained in the processing-object tagged 
document, the tags including a language tag 
designating a kind of language. 

According to still a further aspect of the 
present invention, there is provided a recording 
medium readable by a computer, the recording medium 
having a program recorded thereon, the program for 
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enabling the computer to execute a step of determining 
an original text and a translated version 
corresponding to the original text in a processing- 
object tagged document, and a step of enabling editing 
of the original text and the translated version by 
extracting the original text and the translated 
version from the tagged document and by displaying the 
original text and the translated version in an 
or iginal -versus -vers ion form . 

The program recorded on this recording medium may 
enable the computer to further execute a step of 
forming a translated version by translating the 
original text after editing of the original text, and 
a step of restoring, from the original text after the 
editing and the translated version made by the 
translation, a tagged document in the same format as 
the processing-object tagged document according to 
tags contained in the processing-object tagged 
document, the tags including a language tag 
designating a kind of language. 

According to still a further aspect of the 
present invention, there is provided a recording 
medium readable by a computer, the recording medium 
having a tagged document recorded thereon, the 
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recorded tagged document comprising a first language 
tag designating a first kind of language; a first text 
following the first language tag, the first text being 
described in a first language; a second language tag 
following the first text, the second language tag 
designating a second kind of language; and a second 
text following the second language tag, the second 
text being described in a second language and 
corresponding to the first text. 

According to still a further aspect of the 
present invention, there is provided a recording 
medium readable by a computer, the recording medium 
having a tagged document recorded thereon, the 
recorded tagged document comprising an invisibility 
start tag for setting a following text in an invisible 
state; a first language tag following the invisibility 
start tag, the first language tag designating a first 
kind of language; a first text following the first 
language tag, the first text being described in a 
first language; a second language tag following the 
first text, the second language tag designating a 
second kind of language; an invisibility end tag 
following the second language tag, the invisibility 
end tag canceling the invisible state; and a second 
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text following the invisibility end tag, the second 
text being described in a second language and 
corresponding to the first text. 

According to a further aspect of the present 
invention, there is provided a document processing 
method comprising a step of setting a language tag 
designating a kind of language at each of constituent 
unit positions in an original and those in a 
translated version corresponding to the original; and 
a step of forming a tagged document including the 
original and the translated version each having the 
language tag set therein. 

The above-described document processing method 
further comprises a step of setting a visibility 
control tag for controlling any document portion so 
that the document portion is invisible, said 
visibility control tag being set at such a position 
that one of the original and the translated version i 
in an invisible state. 

According to still a further aspect of the 
present invention, there is provided a document 
processing method comprising a step of setting a 
language tag designating a kind of language at each o 
constituent unit positions in a first text described 
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in a first language and those in a second text 
described in a second language of the first text; and 
a step of forming a tagged document including the 
first text and the second text each having the 
language tag set therein. 

The above-described document processing method 
further comprises a step of setting a visibility 
control tag for controlling a text so that the 
document is invisible, said visibility control tag 
being set at such a position that one of the first 
text and the second text is in an invisible state. 

According to still a further aspect of the 
present invention, there is provided a document 
processing method comprising a step of determining and 
extracting an original text from a processing-object 
tagged document; and a step of enabling editing of the 
original text extracted from the tagged document by 
displaying the original text. 

The above-described document processing method 
further comprises a step of making a translated 
version by translating the original text after editing 
of the original text; and a step of restoring, from 
the original text after the edit processing and the 
translated version formed by said translation, a 
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tagged document in the same format as the processing- 
object tagged document according to tags contained in 
the processing-object tagged document, said tags 
including a language tag designating a kind of 
language . 

According to still a further aspect of the 
present invention, there is provided a document 
processing method comprising a step of determining an 
original text and a translated version corresponding 
to the original text in a processing-object tagged 
document; and a step of enabling edit processing of 
the original text and the translated version by 
extracting the original text and the translated 
version from the tagged document and by displaying the 
original text and the translated version in an 
original-versus-version form . 

The above-described document processing method 
further comprises a step of forming a translated 
version by translating the original text after editing 
of the original text; and a step of restoring, from 
the original text after the edit processing and the 
translated version made by said translation, a tagged 
document in the same format as the processing-object 
tagged document, according to tags contained in the 
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processing-object tagged document, said tags including 
a language tag designating a kind of language. 

According to the present invention, it is not 
necessary to form and manage a file with respect to 
each of a plurality of languages, so that the 
maintenance can be easily carried out. 

According to the present invention, an original 
text and a translated version of the original text 
obtained by translation are displayed in an original- 
ver sus-ver s ion form such that edit processing and 
version can be easily carried out. 

According to the present invention, an original 
text in a tagged document and a translated version of 
the- original text obtained by translation can be 
selectively displayed. 

Further, according to the present invention, an 
edited translated version of the result of translation 
can be reformatted to the tagged document and used. 

Further, according to the present invention, an 
original text in a tagged document can be easily 
edited to obtain a more favorable translated version. 

BRIEF DESCRIPTION OF THE DRAWINGS 
These objects and advantages of the present 
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invention will become more apparent and more readily 
appreciated from the following detailed description of 
the presently preferred exemplary embodiments, taken 
in conjunction with the accompanying drawings of 
which : 

Fig. 1 is a block diagram showing the 
configuration of an HTML document processing apparatus 
which represents a first embodiment of the present 
invention ; 

Fig. 2 is a diagram schematically showing the 
format and contents of an HTML document; 

Fig. 3 is a diagram explaining a state where HTML 
documents are combined into one file and stored, and 
an edited state by a translation editor; 

Fig. 4 is a flowchart for explaining the 
operation of the HTML document processing apparatus of 
the first embodiment; 

Fig. 5 is another flowchart for explaining the 
operation of the HTML document processing apparatus of 
the first embodiment; 

Fig. 6 is a diagram showing a specific example of 
an original-versus-version HTML document source; 

Fig. 7 is a block diagram showing the 
configuration of an HTML document processing apparatus 
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which represents second and third embodiments of the 
present invention ; 

Fig. 8 is a flowchart for explaining the 
operation of the HTML document processing apparatus of 
the second embodiment; 

Fig. 9 is another flowchart for explaining the 
operation of the HTML document processing apparatus of 
the second embodiment; 

Fig. 10 is a diagram showing an example of an 
initial window of an English- Japanese translation 
editor (or iginal-ver sus-ver s ion editor program) ; 

Fig. 11 is a flowchart for explaining the 
operation of the HTML document processing apparatus of 
the third embodiment; and 

Fig. 12 is another flowchart for explaining the 
operation of the HTML document processing apparatus of 
the third embodiment. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Embodiments of the present invention will be 
described with reference to the accompanying drawings . 
[First Embodiment ] 

[Configuration of HTML Document Processing 
Apparatus ] 
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Fig. 1 shows functional components of an HTML 
document processing apparatus 1 which represents a 
first embodiment of the present invention. As shown in 
Fig. 1, the HTML document processing apparatus 1 is 
connected to a World Wide Web (WWW) server 3 through 
an IP communication network, i.e., Internet 2. In this 
embodiment, the WWW server 3 holds English home pages 
(Web pages in which contents are described in English) 

The HTML document processing apparatus 1 can be 
realized by a personal computer, and has an input 
section 10, a display section 11, a display processing 
section 12, a communication control section 13, an 
automatic translation processing section 14, an 
or iginal-ver sus-ver s ion HTML document forming 
processing section 15, an individual-language HTML 
document conversion processing section 16, a data 
storage section 17, etc. The HTML document processing 
apparatus 1 is connected to a WWW browser 4 provided 
as an Internet display tool. 

The personal computer has, as is well known, a 
central processing unit (CPU) , a random access memory 
(RAM) , a hard disk, a drive unit for accessing a 
portable recording medium, such as a floppy disk or a 
compact disk-read only memory (CD-ROM), to read out a 
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program or data recorded thereon, a communication 
control unit, such as a modem or a local area network 
(LAN) board, a display, a keyboard, and a mouse. 

The input section 10 of the HTML document 
processing apparatus 1 in this embodiment corresponds 
to the keyboard and the mouse; the display section 11 
to the display; the data storage section 17 to the RAM, 
the hard disk or the portable recording medium; and 
the communication control section 13 to the 
communication control unit. 

The display processing section 12, the automatic 
translation processing section 14, the original- 
ver sus-ver s ion HTML document forming processing 
section 15, and the individual-language HTML document 
conversion processing section 16 are realized by 
making the CPU execute a program of the present 
invention and a program of Internet translation 
software . 

The program of the present invention and the 
Internet translation software are provided by being 
recorded on the portable recording medium, and are 
read out by the drive unit and stored in the hard disk 
in advance . 

The above-described program and software stored 



23 



in an external unit, may be downloaded to the HTML 
document processing apparatus 1 through the network 
using the communication control unit, and stored in 
the hard disk. 

The program of the present invention and the 
Internet translation software stored in the hard disk 
are read to the RAM to be executed by the CPU. The 
programs of the present invention and the Internet 
translation software may be directly read to the RAM 
without being stored in the hard disk, instead of 
being temporality stored in the hard disk and 
thereafter read to the RAM as described above. 

HTML documents formed by the HTML document 
processing apparatus 1 in this embodiment of the 
present invention are stored in the RAM, the hard disk 
or the portable recording medium. 

In the HTML document processing apparatus 1 
arranged as described above, the automatic translation 
processing section 14 translates an English HTML 
document (an original including sentences and figures) 
of an English home page described in English into a 
Japanese HTML document (a translated version) 
described in Japanese by using the home page 
translation functions of the Internet translation 
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software, e.g., 11 ATLAS " (trademark of FUJITSU LIMITED) 
and the original and the results of the translation 
are visually displayed so that pairs of original and 
translated paragraphs or sentences are juxtaposed in 
upper and lower positions, as described below in 
detail . 

In this translation and display process, the 
or iginal-ver sus-ver sion HTML document forming 
processing section 15 attaches a language tag, which 
is a special tag for identification of the kind of 
language, to the beginning of each of paragraphs or 
sentences which are constituent units of the original 
document and the translated document. The individual- 
language HTML document conversion processing section 
16 recognizes the attached language tag to enable 
selective display of only one of the original and 
translated documents by further adding an invisibility 
tag (visibility control tag) to instruct to make no 
visual display. 

[Operation of HTML Document Processing Apparatus] 
The operation of the HTML document processing 
apparatus 1 in the first embodiment of the present 
invention shown in Fig. 1 will now be described. Fig. 
2 schematically shows the compositions and contents of 
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original and translated documents and a state where 
these documents are stored and displayed by being 
juxtaposed in the above-mentioned manner. Fig. 3 shows 
a state where an original HTML document and a version 
of the HTML document are stored and displayed while 
being combined into one file in such a form that each 
of pairs of original and translated sentences are 
juxtaposed in left and right positions. Figs. 4 and 5 
are flowcharts for explaining the operation of the 
HTML document processing apparatus 1. 

(Original-versus-ver sion HTML Document Forming 
Processing) 

The operation will be described by way of example 
with respect to a case where, in the HTML document 
processing apparatus 1, an English HTML document 
(original) of an English home page described in 
English is translated into a Japanese HTML document 
(translated version) described in Japanese, and an 
or iginal-ver sus-ver s ion HTML document is formed from 
the original and version and is visually displayed. 

Referring also to Figs. 1 through 5, a user using 
the HTML document processing apparatus 1 searches the 
Internet for a home page described in a language which 
he or she wishes to translate, by operating the input 
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section 10 and based on the contents of a display made 
by the display section 11 (WWW browser) . In this 
example, the object of this search (processing step 
S401 in Fig. 4) is an English home page held by the 
WWW server 3 . 

Next, the user initiates a program for 
translation in the desired direction by operating the 
input section 10 and based on the contents of the 
display made by the display section 11. In this 
example, the user wishes to translate an English HTML 
document of the English home page into a Japanese HTML 
document described in Japanese and therefore initiates 
an English-into- Japanese translation program (S402) . 

The user clicks a "translation" button displayed 
in an initial window (dialogue window) (not shown) of 
the translation program to make the HTML document 
processing apparatus 1 translate and arrange the 
English HTML document in such a manner that each of 
pairs of original and translated paragraphs or 
sentences can be displayed by being juxtaposed in 
upper and lower positions (S403) . According to the 
user's "translation" button clicking operation, the 
automatic translation processing section 14 having the 
translation program read out from the hard disk, 



27 



translates the English HTML document supplied from the 
WWW server 3 through the communication control section 
13 into a Japanese document, and thereby forms an HTML 
document in which each of a certain number of English 
paragraphs or sentences are followed by a line break 
tag and a corresponding Japanese paragraph or sentence, 
that is, each of pairs of original and translated 
paragraphs or sentences are displayed by being 
juxtaposed in upper and lower positions. Known 
techniques can be used for the process of translation 
in the automatic translation processing section 14, 
and therefore such translation techniques will not be 
described in detail in this specification. 

The original-ver sus-ver s ion HTML document forming 
processing section 15, supplied with the results of 
translation performed by the automatic translation 
section 14, i.e., the HTML document containing 
Japanese sentences, starts analyzing the HTML document 
(S404) to add language tags for enabling 
discrimination between the original and translated 
paragraphs or sentences in the individual-language 
HTML document conversion processing section 16 
described below in detail. 

The or iginal-ver sus-ver s ion HTML document forming 
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processing section 15 first recognizes the part of a 
text of the inputted HTML document. The original- 
ver sus-ver s ion HTML document forming processing 
section 15 detects an HTML tag such as specifically 
shown in Fig. 6 (shown in the space between symbols < 
and > in Fig. 6) to recognize the section following 
this tag as English texts and Japanese texts (S405) . 

Next, the original-ver sus-vers ion HTML document 
forming processing section 15 determines, if the texts 
recognized in step S405 are an English paragraph or 
sentence (original) or a Japanese paragraph or 
sentence (translated version) according to the 
position of the paragraphs or sentences (in the order 
of an English paragraph or sentence, a line break tag, 
and a Japanese paragraph or sentence) (step S406) . If 
the or iginal-ver sus-ver s ion HTML document forming 
processing section 15 recognizes an English paragraph 
or sentence as a result of this determination, it 
embeds a language tag <E> at the beginning of the 
document-component-unit paragraph or sentence (S407). 
If it recognizes a Japanese paragraph or sentence, it 
embeds a language tag <J> at the beginning of the 
document-component-unit paragraph or sentence (S408) . 

The original-ver sus-ver s ion HTML document forming 
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processing section 15 proceeds to processing of the 
next paragraph in the original-versus-version HTML 
document, and makes a determination as to whether an 
HTML tag </html> indicating the end of the HTML 
document is detected or not (steps S409, S410). The 
original-ver sus-version HTML document forming 
processing section 15 repeats the above-described 
steps S404 to S410 until the HTML tag indicating the 
end of the HTML document is detected. 

The or iginal-ver sus-ver s ion HTML document forming 
processing section 15 stores in the data storage 
section 17 an or iginal-ver sus-ver s ion HTML document 
obtained as a result of the original-versus-version 
HTML document forming processing based on the above- 
described sequence of document analysis steps, when 
the HTML tag indicating the end of the HTML document 
is detected. The state of data thereby stored is shown 
as an original-versus-version HTML document source 21 
in Fig. 2. In Fig. 2, symbols A, and V indicate 
English sentences and symbols O, and © indicate 
Japanese sentences . 

After the original-versus-version HTML document 
forming processing performed by the original-versus- 
version HTML document forming processing section 15 
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based on the above-described sequence of document 
analysis steps, the display processing section 12 
visually displays the stored original-ver sus-ver s ion 
HTML document in the upper-lower juxtaposition form on 
the screen of the display section 11 (WWW browser 4) 
(step S411) . 

The or iginal-ver sus-ver s ion HTML document source 
21 is displayed in the upper-lower juxtaposition form, 
as indicated by reference numerals 20 in Fig. 2. In 
Fig. 2, " image 1" and "image2" represent figures in the 
document . 

(Processing for Selectively Displaying English 
HTML Document (Original) and Japanese HTML Document 
(vers ion ) ) 

In the HTML document processing apparatus, 
processing described below is performed after the 
or iginal-ver sus-ver s ion HTML document forming 
processing based on the above-described sequence of 
document analysis steps. The processing described 
below enables selective display of only one of the 
English HTML document (original) and the Japanese HTML 
document (translated version) displayed in the upper- 
lower juxtaposition form on the screen of the display 
section 1 1 . 
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The user determines display of the document in 
the particular language for selective display of the 
document, and clicks a button corresponding to the 
language to be displayed (a "Japanese" button in this 
example) (S412 in Fig. 5) . In response to the user's 
"Japanese" button clicking operation, the HTML 
document processing apparatus 1 hands over processing 
to the individual-language HTML document conversion 
processing section. 16 to enable selective display of 
only the English HTML document (original) or the 
Japanese HTML document (translated version) . In the 
case where the objective languages are English and 
Japanese as in this example, "English" and "Japanese" 
buttons (not shown) are displayed. 

The individual-language HTML document conversion 
processing section 16 reads out the or iginal-ver sus- 
version HTML document source 21 to the data storage 
section 17 as shown in Fig. 2, and starts analyzing 
the document (S413). The individual-language HTML 
document conversion processing section 16 makes a 
determination as to whether or not a language tag <E> 
or <J> embedded in the or iginal-ver sus-vers ion HTML 
document source 21 has been detected (S414). 

If the individual-language HTML document 
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conversion processing section 16 has detected a 
language tag in the English original-vs . -Japanese 
version HTML document, it sets an invisibility start 
tag <INVIS IBILITY START> as a visibility control tag 
immediately before the detected language tag (S415) . 

Thereafter, to enable visual display of only the 
Japanese sentences in the English original-vs . - 
Japanese version HTML document, the individual- 
language HTML document conversion processing section 
16 makes a determination as to whether or not the 
language tag <J> is detected (S416) . If the 
individual-language HTML document conversion 
processing section 16 has detected a language tag <J>, 
it sets an invisibility end tag <INVISIBILITY END> as 
a visibility control tag immediately after the 
detected tag (immediately before a translated 
sentence) ( S417 ) . 

The well-known comment tag may be used as the 
above-mentioned visibility control tag. A sentence or 
paragraph bracketed by a pair of comment tags is not 
displayed by the WWW browser 4. 

If the individual-language HTML document 
conversion processing section 16 recognizes the 
detected language tag as a language tag other than the 
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language tag <J> in above step S416, it sets an 
invisibility end tag <INVISIBILITY END> at the end of 
the paragraph (S418). This process is effective in 
processing an original-ver sus-ver s ion HTML document 
containing sentences written in three or more 
languages. For example, if the or iginal-ver sus-ver s ion 
HTML document contains German sentences (original) 
together with English sentences (original) and 
Japanese sentences (translated version), this process 
makes the German sentences invisible as well as the 
English sentences . 

An or iginal-ver sus-ver sion HTML document 22 
having invisibility tags <INVISIBILITY START> , 
<INVISIBILITY END> as visibility control tags added 
thereto as well as language tags <E> and < J> , as shown 
in Fig. 2, is formed in this manner to display 
Japanese sentences only. Fig. 2 shows this HTML 
document with source codes. 

If no language tag is detected in above step S414, 
or after the completion of processing of the step S417 
or 418, the individual-language HTML document 
conversion processing section 16 proceeds to 
processing of the next paragraph in the original- 
versus-version HTML document, and makes a 
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determination as to whether or not an HTML tag </html> 
indicating the end of the HTML document has been 
detected (steps S419, S420). The individual-language 
HTML document conversion processing section 16 repeats 
the above-described steps S413 to S420 until the HTML 
tag indicating the end of the HTML document is 
detected . 

The individual-language HTML document conversion 
processing section 16 stores in the data storage 
section 17 the HTML document with invisibility tags 
obtained as a result of the above-described sequence 
of process steps for selective display of the 
original-versus-version HTML document when the HTML 
tag indicating the end of the original-ver sus-ver sion 
HTML document is detected. 

The display processing section 12 then opens the 
HTML document with invisibility tags by the WWW 
browser 4, and visually displays the document on the 
screen of the display section 11 (S421) . A Japanese 
sentence HTML display 23 shown in Fig. 2 represents 
the state of this display on the screen. 

Further, Fig. 2 shows the source code for 
enabling display of English sentences only in an 
or iginal-ver sus-ver sion HTML document 24, and shows an 
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HTML display 25 which represents the state of display 
on the screen when the HTML document 24 is displayed 
by the WWW browser 4 . 

[Second Embodiment ] 

[Configuration of HTML Document Processing 
Apparatus ] 

Fig. 7 shows functional components of an HTML 
document processing apparatus 1 which represents a 
Embodiment 2 of the present invention. The HTML 
document processing apparatus 1 is connected to a 
World Wide Web (WWW) server 3 through an IP 
communication network, i.e., Internet 2. In this 
embodiment, the WWW server 3 holds English home pages. 

The HTML document processing apparatus 1 can be 
realized by a personal computer, and has an input 
section 10, a display section 11, a display processing 
section 12, a communication control section 13, an 
automatic translation processing section 14, an 
original-versus-version HTML document forming 
processing section 15, an individual-language HTML 
document conversion processing section 16, a data 
storage section 17, an HTML document analysis and 
conversion processing section 18, an English- Japanese 
translation editor (original-versus-version editor 
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program) 19, an or iginal-versus-version HTML document 
conversion processing section 40, etc. The HTML 
document processing apparatus 1 is connected to a WWW 
browser 4 provided as an Internet display tool. 

In the HTML document processing apparatus 
arranged as described above, the automatic translation 
processing section 14 translates an English HTML 
document (an original including sentences and figures) 
of an English home page described in English into a 
Japanese HTML document (a translated version) 
described in Japanese, by using the home page 
translation functions of the above-mentioned Internet 
translation software, and the original and the results 
of the translation are visually displayed in such a 
form that each of pairs of original and translated 
paragraphs or sentences are juxtaposed in upper and 
lower positions. 

In this translation and display process, the 
or iginal- ver sus-ver s ion HTML document forming 
processing section 15 attaches a language tag, which 
is a special tag for determination of the kind of 
language, to the beginning of each of paragraphs or 
sentences which are constituent units of the original 
document and the translated document. 
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The HTML document analysis and conversion 
processing section 18 and the English- Japanese 
translation editor (original-versus-ver s ion editor 
program) 19 extract data with respect to each language, 
when data is edited in an or iginal-ver sus-vers ion form. 
After editing of or iginal-ver sus-version data 
performed by the editor, HTML tags are restored to 
form an HTML document from the edited data. 

If an HTML document is translated by Internet 
translation software, translation results outputted in 
an or iginal-ver sus-ver s ion form cannot be edited. 
However, document data prepared by removing HTML tags 
from the translation results is supplied to the editor 
capable of editing data in an or iginal-ver sus-ver s ion 
form to enable editing of the translation results. 
Also, HTML tags are restored in edit results to enable 
the edit results to be used as an HTML document. 

[Operation of HTML Document Processing Apparatus] 
The operation of the HTML document processing 
apparatus 1 in Embodiment 2 of the present invention 
shown in Fig. 7 will now be described. Figs. 8 and 9 
are flowcharts for explaining the operation of the 
HTML document processing apparatus 1. 

( Or iginal-ver sus-ver s ion HTML Document Forming 
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Processing) 

Referring to Figs. 7 through 9, in the HTML 
document processing apparatus 1, an English HTML 
document (original) of an English home page described 
in English is translated, an HTML document including 
the formed original and translated version is analyzed 
and an or iginal-ver sus-ver s ion HTML document having 
language tags embedded therein is formed and visually 
displayed. This document forming and displaying 
processing is the same as that performed by the HTML 
document processing apparatus 1 Embodiment 1 described 
above, i.e., the process steps S401 to S414 shown in 
Fig . 4 . 

(Original-versus-version HTML Document Data 
Conversion Processing and Translated Version Edit 
Processing) 

In the HTML document processing apparatus 1, 
after the above-described processing of forming an 
original-versus-version HTML document displayed in an 
upper-lower juxtaposition form on the screen of the 
display section 11, data conversion processing and 
translated version edit processing can be performed in 
a process described below. 

When a user clicks an "original-versus-version 
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edit processing start 1 ' button, the automatic 
translation processing section 14 having translation 
software hands over processing to the HTML document 
analysis and conversion processing section 18 (S812 in 
Fig. 8) to enable editing of the Japanese HTML 
document (version) in the or iginal-ver sus-ver s ion HTML 
document 20 shown in Fig. 2. 

The HTML document analysis and conversion 
processing section 18 starts processing for enabling 
editing using the English- Japanese translation editor 
( or iginal-ver sus-ver s ion editor program) 19 by 
removing language tags from the or iginal-ver sus- 
version HTML document source 21 having language tags 
<E> and <J> added thereto (S813). 

First, the HTML document analysis and conversion 
processing section 18 makes a determination as to 
whether a language tag <E> or <J> embedded in the 
or iginal-ver sus-ver s ion HTML document in the original- 
versus-ver s ion HTML document forming processing 
section 15 has been detected (S814) . 

If the HTML document analysis and conversion 
processing section 18 has detected a language tag <E> 
or <J> in the English or iginal-ver sus- Japanese version 
HTML document, it stores in the data storage section 
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17 language data together with language tag start and 
end addresses (S815) . 

Thereafter, to enable display and edit of English 
sentences in the English original-vs . -Japanese version 
HTML document, the HTML document analysis and 
conversion processing section 18 makes a determination 
as to whether or not the detected language tag is a 
language tag <E> (S816). If the detected language tag 
is a language tag <E>, the HTML document analysis and 
conversion processing section 18 stores in an original 
document storage area in the data storage section 17, 
the start address of a document constituent unit 
paragraph (or sentence) and the contents of the 
paragraph (S817) . 

If the result of above determination in step S816 
detects a language tag < J> , the HTML document analysis 
and conversion processing section 18 stores in a 
version storage area in the data storage section 17, 
the start address of a document constituent unit 
paragraph (or sentence) and the contents of the 
paragraph (S818) to enable Japanese sentence display 
and edit. As a result of the above process steps S815 
to S818, English and Japanese paragraphs or sentences 
from which HTML tags have been removed and the start 
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and end addresses of the paragraphs or sentences are 
stored in the data storage section 17. 

If no language tag is detected in above step S814, 
or after the completion of processing in above step 
S817 or 818, the HTML document analysis and conversion 
processing section 18 proceeds to processing of the 
next paragraph in the or iginal-ver sus-ver sion HTML 
document 21, and makes a determination as to whether 
or not an HTML tag </html> indicating the end of the 
HTML document is detected (steps S819, S820). 

The HTML document analysis and conversion 
processing section 18 repeats the above-described 
steps S813 to S820 in cooperation with the English- 
Japanese translation editor (original-versus -vers ion 
editor program) 19, until the HTML tag indicating the 
end of the HTML document is detected. Fig. 10 shows an 
example of an initial window of the English- Japanese 
translation editor. 

When the HTML tag indicating the end of the 
or iginal-ver sus-ver sion HTML document is detected, the 
HTML document analysis and conversion processing 
section 18 visually displays (S821), in the window of 
the English- Japanese translation editor (display 
section 11), the results of the above-described 
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sequence of process steps for converting the original- 
ver sus-ver s ion HTML document data, i.e., English 
paragraphs or sentences (original) and Japanese 
paragraphs or sentences (translated version) from 
which the language tags and the HTML tags have been 
removed . 

The user edits the Japanese sentences according 
to his or her need on the basis of the English and 
Japanese paragraphs or sentences visually displayed in 
the left-right juxtaposition form in the window of the 
English- Japanese translation editor. The symbol • in 
the translated sentences shown in Fig. 3 designates 
edited portions. When the user clicks an "end" button 
in the English- Japanese translation editor window 
after suitably editing the Japanese version, the 
English-Japanese translation editor (original-versus- 
version editor program) 19 stores the results of 
editing in the data storage section 17, thereby 
completing the edit processing (S822). 

(Processing for Restoration to HTML Document 
Format) 

In the HTML document processing apparatus 1, 
after the above-described original-versus-version HTML 
document data conversion processing and translated 
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version editing processing, a process described below 
is further performed to enable restoration to the HTML 
document format. 

When the user clicks an 11 HTML display" button in 
the Engl ish- Japanese translation editor window, the 
English- Japanese trans la t ion editor (original-versus- 
version editor program) 19 is started (S823) . By an 
instruction from the Engl i sh- Japane s e translation 
editor 19, the or iginal-ver sus-ver s ion HTML document 
conversion processing section 40 starts analyzing the 
edited document (S824) . 

First, the original-ver sus-ver s ion HTML document 
conversion processing section 40 reads out one of the 
English paragraphs or sentences (original) or the 
Japanese paragraphs or sentences (translated version) 
stored in the data storage section 17, and makes a 
determination as to whether the document constituent 
unit, i.e., the paragraph or sentence, is English 
(original) or Japanese (translated version) (S825) . 
This determination can be definitely made based on the 
language tags stored in the data storage section 17 . 

If the result of this determination is that the 
paragraph or sentence read out is English, the 
or iginal-ver sus-ver s ion HTML document conversion 
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processing section 40 adds the language tag <E> to the 
beginning of the paragraph (S826) . If the paragraph 
read out is Japanese, the or iginal-ver sus-ver sion HTML 
document conversion processing section 40 adds the 
language tag <J> to the beginning of the paragraph 
(S827). The or iginal-ver sus-ver s ion HTML document 
conversion processing section 40 substitutes (S828) 
the paragraph of the edited document for the 
corresponding paragraph of the or iginal-ver sus-ver sion 
HTML document before editing by referring to the 
paragraph start address stored in the data storage 
section 1 7 . 

The or iginal-ver sus-ver s ion HTML document 
conversion processing section 40 then proceeds to 
processing of the next paragraph and makes a 
determination as to whether or not the end of the 
edited document has been reached (steps S829, S830) . 
The or iginal-ver sus-vers ion HTML document conversion 
processing section 40 repeats the above-described 
steps S824 to S830 until the end of the edited 
document is reached. 

The or iginal-ver sus-ver sion HTML document 
conversion processing section 40 stores in the data 
storage section 17 the or iginal-ver sus-ver s ion HTML 
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document obtained as a result of the above-described 
sequence of HTML document restoration processing steps. 
The display processing section 12 opens, by the WWW 
browser 4, the or iginal-ver sus-ver s ion HTML document 
stored in the data storage section 17, thereby making 
a display reflecting the editing results (S831) . 

When the or iginal-ver sus-ver s ion HTML document is 
displayed on the editor, the English paragraphs or 
sentences and the positions of the same, and the 
Japanese paragraphs or sentences and the positions of 
the same in the or iginal-ver sus-ver s ion HTML document 
are stored in the respective storage areas in the data 
storage section 17, and the or iginal-ver sus-ver s ion 
HTML document before editing is also held by the data 
storage section 17. From the information on the 
positions of the English and Japanese paragraphs or 
sentences stored in the storage areas, the other 
portions can be identified as the tags. In this manner, 
the positions of the tags can be obtained reciprocally 
from the positions of the English and Japanese 
paragraphs or sentences, and HTML document restoration 
processing can be performed by using the tags in the 
original-versus-version HTML document before editing 
with reference to the tag position information 
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obtained from the paragraph or sentence positions. 

The formed or iginal-ver sus-ver s ion HTML document 
can be displayed in the form of an or iginal-ver sus- 
version home page by the Internet display tool WWW 
browser 4 if it may formed without invisibility tags. 
Selective display processing (see Fig. 5) of the 
or iginal-ver sus-ver s ion HTML document may be performed 
by the individual-language HTML document conversion 
processing section 16 if necessary, as in the HTML 
document processing apparatus 1 in above-described 
Embodiment 1, thereby enabling only the Japanese HTML 
document (version) obtained by this processing to be 
visually displayed by the display section 11. 

[Embodiment 3] 

[Configuration of HTML Document Processing 
Apparatus ] 

An HTML document processing apparatus 1 which 
represents Embodiment 3 of the present invention has 
basically the same functions and configuration as the 
HTML document processing apparatus 1 of Embodiment 2 
shown in Fig . 7 . 

In the HTML document processing apparatus 1 in 
the third embodiment, however, there is no need for 
the automatic translation processing section 14 and 
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the original-ver sus-ver s ion HTML document forming 
processing section 15. 

In this HTML document processing apparatus 1, the 
HTML document analysis and conversion processing 
section 18 and the English- Japanese translation editor 
( or iginal-ver sus-ver s ion editor program) 19 cooperate 
to perform data conversion by removing HTML tags from 
an English HTML document (original) of an English home 
page described in English, thereby enabling editing of 
the original and inputting of a translated version. A 
user edits the editable English sentences according to 
his or her need and inputs Japanese sentences 
(translated version) . 

Also, the or iginal-ver sus-ver s ion HTML document 
conversion processing section 40 substitutes edited 
English and Japanese sentences for the original 
English sentences, restores the removed HTML tags, and 
sets translation-result Japanese sentences (translated 
version) in correspondence with the edited sentences, 
thereby enabling visual display of an or iginal-ver sus- 
version HTML document containing a more favorable 
version . 

[Operation of HTML Document Processing Apparatus] 
(Data Conversion Processing and Original Edit 
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Processing of English HTML Document (Original)) 

Fig. 7 shows the configuration of this HTML 
document processing apparatus 1, and Figs. 11 and 12 
are flowcharts for explaining the operation of this 
HTML document processing apparatus 1. Referring to 
Figs. 7, 11, and 12, a user using the HTML document 
processing apparatus 1 searches the Internet for a 
home page described in a language which he or she 
wishes to translate, by operating the input section 10 
and based on the contents of a display made by the 
display section 11. In this example, the object of 
this search (processing step S1101 in Fig. 11) is an 
English home page held by the WWW server 3. 

Next, the user initiates a program for 
translation in the desired direction by operating the 
input section 10 and based on the contents of the 
display made by the display section 11. In this 
example, the English home page is to be finally 
translated into a Japanese HTML document described in 
Japanese. Therefore, an English-into- Japanese 
translation program is started (S1102) . 

The user clicks an " or iginal-ver sus-ver s ion edit 
processing start" button in an initial window 
(dialogue window) (not shown) of the translation 
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program to start the English- Japanese translation 
editor ( or iginal-ver sus-ver s ion editor program) 19 
(S1103) . 

The HTML document analysis and conversion 
processing section 18 starts analyzing (S1104) the 
English HTML document (original) supplied from the WWW 
server 3 through the communication control section 13. 
First, the HTML document analysis and conversion 
processing section 18 recognizes a text portion of the 
English HTML document (original) . The HTML document 
analysis and conversion processing section 18 detects 
an HTML tag to recognize the section following this 
tag as an English text (S1105). 

Next, the HTML document analysis and conversion 
processing section 18 stores in the original storage 
area in the data storage section 17 the start address 
of a document constituent unit paragraph (or sentence) 
and the contents of the paragraph (S1106) . As a result 
of the processing in the above steps S1105 and S1106, 
the English paragraph or sentence from which the HTML 
tag has been removed is stored in the data storage 
section 1 7 . 

If HTML tag is detected in above step S1105, or 
after the completion of processing in the step S1106, 
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the HTML document analysis and conversion processing 
section 18 proceeds to processing of the next 
paragraph in the English HTML document, and makes a 
determination as to whether or not an HTML tag </html> 
indicating the end of the HTML document is detected 
(steps S1107 , S1108). 

The HTML document analysis and conversion 
processing section 18 repeats the above-described 
steps S1104 to S1108 in cooperation with the English- 
Japanese translation editor (original-versus-version 
editor program) 19 until the HTML tag indicating the 
end of the English HTML document is detected. 

When the HTML tag indicating the end of the 
English HTML document is detected, the HTML document 
analysis and conversion processing section 18 visually 
displays (S1109) in a left-hand section of the window 
of the English- Japanese translation editor (display 
section 11), the results of the above-described 
sequence of process steps for converting the English 
HTML document data. 

The user performs manual translation on the basis 
of the English paragraphs or sentences (original) 
visually displayed in the window of the English- 
Japanese translation editor, and forms translated 
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sentences (Japanese sentences) by operating the input 
section 10. In this translation processing, the user 
edits the English sentences and, if necessary the 
Japanese translated sentences according to his or her 
need. When the user clicks an "end" button in the 
English- Japanese translation editor window after 
suitably editing the original and completing the 
Japanese version, the English- Japanese translation 
editor ( or iginal-ver sus-ver s ion editor program) 19 
stores the results of editing and translation in the 
data storage section 17, thereby completing the 
processing (S1110) . 

(Processing for Restoration to HTML Document 
Format) 

In the HTML document processing apparatus 1, 
after the above-described English HTML document 
(original) data conversion processing and original 
editing processing, a process described below is next 
performed to enable restoration processing of the HTML 
document format. 

When the user clicks an w HTML display" button in 
the English- Japanese translation editor window, the 
English- Japanese translation editor (original-versus- 
version editor program) 19 is again started (Sllll). 
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By an instruction from the Engl i sh- Japane s e 
translation editor 19, the original-ver sus-version 
HTML document conversion processing section 40 starts 
analyzing the resulting edited document (S1112) . 

First, the original-ver sus-version HTML document 
conversion processing section 40 reads out one of the 
English paragraphs or sentences (original) stored in 
the data storage section 17, and makes a determination 
as to whether or not the document constituent unit, 
i.e., the paragraph (or sentence), is English 
(original) (S1113) . This determination can be 
definitely made based on the storage positions stored 
in the left-right juxtaposition form in the data 
storage section 17. 

If the result of this determination is that the 
paragraph or sentence read out is English, the 
original-ver sus-version HTML document conversion 
processing section 40 adds the language tag <E> to the 
beginning of the paragraph (S1114) . If the paragraph 
read out is Japanese, the or iginal-ver sus- ver s ion HTML 
document conversion processing section 40 adds the 

language tag <J> to the beginning of the paragraph 

(S1115) . 

The or iginal-ver sus-ver s ion HTML document 
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conversion processing section 40 substitutes (S1116) 
the paragraph of the resulting edited original with 
the language tag added thereto for the corresponding 
original paragraph of the HTML document before editing, 
by referring to the paragraph start address stored in 
the data storage section 17, and inserts the 
translated sentence formed in the above-described step 
S1110 immediately after the substituted original 
paragraph (S1117) . 

The or iginal-ver sus-ver s ion HTML document 
conversion processing section 40 then proceeds to 
processing of the next paragraph and makes a 
determination as to whether or not the end of the 
edited document has been reached (steps S1118, S1119). 
The or iginal-ver sus-ver s ion HTML document conversion 
processing section 40 repeats the above-described 
steps S1112 to S1119 until the end of the edited 
document is reached. 

The or iginal-ver sus-ver s ion HTML document 
conversion processing section 40 stores in the data 
storage section 17 the or iginal-ver sus-ver s ion HTML 
document obtained as a result of the above-described 
sequence of HTML document restoration processing steps. 
The display processing section 12 then opens, by the 
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WWW browser 4, the or iginal-ver sus-ver s ion HTML 
document stored in the data storage section 17 , 
thereby making a display reflecting the editing 
results (S1120) . 

Also in this embodiment, HTML document 
restoration processing can be performed by using the 
tags in the or iginal-ver sus-version HTML document in 
the same manner as that in Embodiment 2 . 

Also, selective display processing (see Fig. 5) 
of the original-versus-version HTML document may be 
performed by the individual-language HTML document 
conversion processing section 16 if necessary, as in 
the HTML document processing apparatus 1 in the above- 
described Embodiment l f thereby enabling only the 
Japanese HTML document (translated version) obtained 
by this processing to be visually displayed in the 
display section 11. 

[Examples of Modification] 

In the HTML document processing apparatus 1 in 
the above-described third embodiment, translated 
sentences are formed by manual translation. However, 
an automatic translation process using the automatic 
translation processing section 14 may also be 
performed in the third embodiment, similar to that in 
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the HTML document processing apparatuses in the first 
and second embodiments. Conversely, manual translation 
may be performed in the HTML document processing 
apparatuses in the first and second embodiments. 

In each of the HTML document processing 
apparatuses in the above-described embodiments, an 
original written in English, is translated into 
Japanese. However, the HTML document processing 
apparatus of the present invention can also operate in 
the same manner with respect to the respective 
languages, and can similarly process an HTML document 
having sentences written in three or more languages. 

In the HTML document processing apparatus 1 in 
the first embodiment, invisibility tags designating 
inhibition of visual display are set as visibility 
control tags in an HTML document. However, it is also 
possible to print only a document expressed in a 
particular language selected by setting such tags. 

Each of the above-described processes according 
to the present invention can be applied in association 
with a computer-readable medium. 

Although only a few embodiments of the present 
invention have been described in detail above, those 
skilled in the art will readily appreciate that many 
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modifications are possible in the preferred 
embodiments without departing from the novel teachings 
and advantages of the present invention. Accordingly, 
all such modifications are intended to be included 
within the scope of the present invention as defined 
by the following claims. 



57 



